On InChI and evaluating the quality of cross-reference links
نویسندگان
چکیده
BACKGROUND There are many databases of small molecules focused on different aspects of research and its applications. Some tasks may require integration of information from various databases. However, determining which entries from different databases represent the same compound is not straightforward. Integration can be based, for example, on automatically generated cross-reference links between entries. Another approach is to use the manually curated links stored directly in databases. This study employs well-established InChI identifiers to measure the consistency and completeness of the manually curated links by comparing them with the automatically generated ones. RESULTS We used two different tools to generate InChI identifiers and observed some ambiguities in their outputs. In part, these ambiguities were caused by indistinctness in interpretation of the structural data used. InChI identifiers were used successfully to find duplicate entries in databases. We found that the InChI inconsistencies in the manually curated links are very high (28.85% in the worst case). Even using a weaker definition of consistency, the measured values were very high in general. The completeness of the manually curated links was also very poor (only 93.8% in the best case) compared with that of the automatically generated links. CONCLUSIONS We observed several problems with the InChI tools and the files used as their inputs. There are large gaps in the consistency and completeness of manually curated links if they are measured using InChI identifiers. However, inconsistency can be caused both by errors in manually curated links and the inherent limitations of the InChI method.
منابع مشابه
Reduced-Reference Image Quality Assessment based on saliency region extraction
In this paper, a novel saliency theory based RR-IQA metric is introduced. As the human visual system is sensitive to the salient region, evaluating the image quality based on the salient region could increase the accuracy of the algorithm. In order to extract the salient regions, we use blob decomposition (BD) tool as a texture component descriptor. A new method for blob decomposition is propos...
متن کاملSurvey on Laboratory Test Quality Control Establishment in Public and Private Medical Laboratories in Hamedan Province
Introduction: The purpose of laboratory test quality control establishment is to ensure the daily stability of the test system. Considering the defined principles guarantees laboratory performance improvement and patients’ confidence in the results of their tests. This study was conducted to survey laboratory test quality control establishment in public and private medical laboratories in Hamed...
متن کاملEvaluating the impact of Environmental Quality Indicators on the degree of humanization in healing environments
During the last 2 decades, the effects of the physical and social environment on the healing process, recovery and well-being of patients, families and staff in hospitals have been proved.There is a growing recognition that healthcare architecture could do more by promoting overall wellness, and this requires expanding the focus to healing.The research on evidence-based design (EBD) has demonst...
متن کاملOn the effect of low-quality node observation on learning over incremental adaptive networks
In this paper, we study the impact of low-quality node on the performance of incremental least mean square (ILMS) adaptive networks. Adaptive networks involve many nodes with adaptation and learning capabilities. Low-quality mode in the performance of a node in a practical sensor network is modeled by the observation of pure noise (its observation noise) that leads to an unreliable measurement....
متن کاملNumerical modeling of links behavior in eccentric bracings with dual vertical links
Configuration and geometry of bracing systems affect the seismic performance of structures significantly. Recently, the current authors have introduced a new configuration for eccentric bracing of structural frames that may be assumed as the combination of inverted Y-type and rotated K-type EBFs. The resulted braced frame is called EBF-DVL, consisting of two vertical links attached together by ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 6 شماره
صفحات -
تاریخ انتشار 2014